Robust Methodology for TTS Enhancement Evaluation

نویسندگان

  • Daniel Tihelka
  • Martin Gruber
  • Zdenek Hanzlícek
چکیده

The paper points to problematic and usually neglected aspects of using listening tests for TTS evaluation. It shows that simple random selection of phrases to be listened to may not cover those cases which are relevant to the evaluated TTS system. Also, it shows that a reliable phrase set cannot be chosen without a deeper knowledge of the distribution of differences in synthetic speech, which are obtained by comparing the output generated by an evaluated TTS system to what stands as a baseline system. Having such knowledge, the method able to evaluate the reliability of listening tests, as related to the estimation of possible invalidity of listening results-derived conclusion, is proposed here and demonstrated on real examples.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Explorer Investigating RNN - based speech enhancement methods for noise - robust Text - to - Speech

The quality of text-to-speech (TTS) voices built from noisy speech is compromised. Enhancing the speech data before training has been shown to improve quality but voices built with clean speech are still preferred. In this paper we investigate two different approaches for speech enhancement to train TTS systems. In both approaches we train a recursive neural network (RNN) to map acoustic featur...

متن کامل

Investigating RNN-based speech enhancement methods for noise-robust Text-to-Speech

The quality of text-to-speech (TTS) voices built from noisy speech is compromised. Enhancing the speech data before training has been shown to improve quality but voices built with clean speech are still preferred. In this paper we investigate two different approaches for speech enhancement to train TTS systems. In both approaches we train a recursive neural network (RNN) to map acoustic featur...

متن کامل

Enhancement of Robust Tracking Performance via Switching Supervisory Adaptive Control

When the process is highly uncertain, even linear minimum phase systems must sacrifice desirable feedback control benefits to avoid an excessive ‘cost of feedback’, while preserving the robust stability. In this paper, the problem of supervisory based switching Quantitative Feedback Theory (QFT) control is proposed for the control of highly uncertain plants. According to this strategy, the unce...

متن کامل

Voicesetting: Voice Authoring UIs for Improved Expressivity in Augmentative Communication

Alternative and augmentative communication (AAC) systems used by people with speech disabilities rely on textto-speech (TTS) engines for synthesizing speech. Advances in TTS systems allowing for the rendering of speech with a range of emotions have yet to be incorporated into AAC systems, leaving AAC users with speech that is mostly devoid of emotion and expressivity. In this work, we describe ...

متن کامل

Threshold shifts and enhancement of cortical evoked responses after noise exposure in rats.

The effect of exposure to various types of noise (broadband, high-frequency or low-frequency) was studied in adult pigmented rats. Thresholds and amplitudes of middle latency responses (MLR) recorded from electrodes implanted on the surface of the auditory cortex were analyzed before and after noise exposure. Exposure to noise with intensities ranging from 105 to 120 dB for 1 h produced only te...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013